Knock-Knock: Acoustic object recognition by using stacked denoising autoencoders
نویسندگان
چکیده
This paper presents a successful application of deep learning for object recognition based on acoustic data. The shortcomings of previously employed approaches where handcrafted features describing the acoustic data are being used, include limiting the capability of the found representation to be widely applicable and facing the risk of capturing only insignificant characteristics for a task. In contrast, there is no need to define the feature representation format when using multilayer/deep learning architecture methods: features can be learned from raw sensor data without defining discriminative characteristics a-priori. In this paper, stacked denoising autoencoders are applied to train a deep learning model. Knocking each object in our test set 120 times with a marker pen to obtain the auditory data, thirty different objects were successfully classified in our experiment and each object was knocked 120 times by a marker pen to obtain the auditory data. By employing the proposed deep learning framework, a high accuracy of 91.50% was achieved. A traditional method using handcrafted features with a shallow classifier was taken as a benchmark and the attained recognition rate was only 58.22%. Interestingly, a recognition rate of 82.00% was achieved when using a shallow classifier with raw acoustic data as input. In addition, we could show that the time taken to classify one object using deep learning was far less (by a factor of more than 6) than utilizing the traditional method. It was also explored how different model parameters in our deep architecture affect the recognition performance.
منابع مشابه
An Efficient Method for Knock Signal Denoising in Spark Ignition Engine
One of the factors that affects the efficiency and lifetime of spark ignited internal combustion engine is “knock”. Knock sensor is a commonly used to detect this phenomenon. However, noise, limits detection accuracy of this sensor. In this study, Empirical Mode Decomposition (EMD) method is introduced as a fully adaptive signal-based analysis. Then, based on weighting decomposition...
متن کاملDecoding Stacked Denoising Autoencoders
Data representation in a stacked denoising autoencoder is investigated. Decoding is a simple technique for translating a stacked denoising autoencoder into a composition of denoising autoencoders in the ground space. In the infinitesimal limit, a composition of denoising autoencoders is reduced to a continuous denoising autoencoder, which is rich in analytic properties and geometric interpretat...
متن کاملImproved Knock Detection Method Based on New Time-Frequency Analysis In Spark Ignition Turbocharged Engine
Premature combustion that affects outputs, thermal efficiencies and lifetimes of internal combustion engine is called “knock effect”. However knock signal detection based on acoustic sensor is a challenging task due to existing of noise in the same frequency spectrum. Experimental results revealed that vibration signals, generated from knock, has certain frequencies related to vibration resonan...
متن کاملMid-level Features for Audio Chord Estimation using Stacked Denoising Autoencoders
Deep neural networks composed of several pre-trained layers have been successfully applied to various tasks related to audio processing. Stacked denoising autoencoders represent one type of such networks. They are discussed in this paper in application to audio feature extraction for audio chord estimation task. The features obtained from audio spectrogram with the help of autoencoders can be u...
متن کاملMarginalized Stacked Denoising Autoencoders
Stacked Denoising Autoencoders (SDAs) [4] have been used successfully in many learning scenarios and application domains. In short, denoising autoencoders (DAs) train one-layer neural networks to reconstruct input data from partial random corruption. The denoisers are then stacked into deep learning architectures where the weights are fine-tuned with back-propagation. Alternatively, the outputs...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Neurocomputing
دوره 267 شماره
صفحات -
تاریخ انتشار 2017